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RAW. SgQUENCE LISTING DATE: 01/15/2002 

PATENT APPLICATION: US/09/868,195 TIME :^ 12:56:33 

Input Set : A:\GJE-71.ST25.txt 

Output Set: N:\CRr3\01152002\I868195.raw ^ 

3 <110> APPLICANT: Hughes, Martin J G 

4 Santangelo, Joseph D " F Nl T F R P O 

5 Lane, Jonathan D I ^ I l» 81 1^ L/ 

6 Feldman, Robert 

7 Moore, Joanne C 

8 Dobson, Richard J 

9 Everest, Paul 

10 Henwood, Caroline J 

11 Dougan, Gordon 

12 Wilson, Rebecca K 

14 <120> TITLE OF INVENTION: Outer Surface Proteins, Their Genes, and Their Use 
16 <130> FILE REFERENCE: GJE-71 

18 <140> CURRENT APPLICATION NUMBER: US 09/868,195 

19 <141> CURRENT FILING DATE: 2001-06-15 
21 <160> NUMBER OF SEQ ID NOS : 12 

2 3 <170> SOFTWARE: Patentin version 3.1 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 1014 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Streptococcus ag^lactiae 

30 <220> FEATURE: 

31 <221> NAME/KEY: CDS 

32 <222> LOCATION: (1)..(1014) 

3 3 <223> OTHER INFORMATION: 
3 6 <4 00> SEQUENCE: 1 

37 atg aca caa gta ttt caa gga cgt agt ttc tta gca gaa aaa gat ttt 

3 8 Met Thr Gin Val Phe Gin Gly Arg Ser Phe Leu Ala Glu Lys Asp Phe 
39 1 5 10 15 
41 tct cgt gag gaa ttt gaa tat ctt att gat ttt tea get cat tta aaa 

4 2 Ser Arg Glu Glu Phe Glu Tyr Leu He Asp Phe Ser Ala His Leu Lys 
43 20 25 30 

45 gac ctt aaa aaa cgt ggt gtt cct cat cat tat ctt gaa ggt aaa aat 144 
4 6 Asp Leu Lys Lys Arg Gly Val Pro His His Tyr Leu Glu Gly Lys Asn 
47 35 40 45 

4 9 att get etc tta ttt gaa aaa aca tct act cgt act cgc gca gcc ttt 

50 He Ala Leu Leu Phe Glu Lys Thr Ser Thr Arg Thr Arg Ala Ala Phe 

51 50 55 60 

53 aca act gca gca att gac eta ggc get cat ccg gaa tac ctt ggt gca 

54 Thr Thr Ala Ala He Asp Leu Gly Ala His Pro Glu Tyr Leu Gly Ala 

55 65 70 75 80 

57 aat gat att caa ctt ggt aaa aaa gaa tea aca gaa gat act get aag 

58 Asn Asp He Gin Leu Gly Lys Lys Glu Ser Thr Glu Asp Thr Ala Lys 

59 85 90 95 

61 gtt tta gga cgt atg ttt gat ggt att gaa ttc cgt ggt ttt age caa 336 

62 Val Leu Gly Arg Met Phe Asp Gly He Glu Phe Arg Gly Phe Ser Gin 

63 100 105 110 
65 aga atg gtt gaa gag ctt get gaa ttt tct gga gta cct gtc tgg aat 



48 



96 



192 



240 



288 



384 



file://C:\Crf 3\Outhold\VsrI868 1 95 .htm 



1/15/02 



Page 2 of 7 



RAW SEQUENCE LISTING . DATE:. 01/15/2002. 

PATENT APPLICATION: US/09/868,195 TIME.: 12 : 56 :-33 

* * 

Input Set : A:\GJE-71.ST25.txt 

Output Set: N:\CRF3\01152002\I868195.raw 



O D g 




V d X 


VjJ X LL 


Glu 


Leu 


Ala 


Glu 


Phe 


Ser 


Glv 


Val 


Pro 


Val 


Trp 


Asn 




D / 




115 










120 










125 










c. Q rrrri" 


tta 


aca 


gat 


gaa 


t era 


cat 


cea 


aca 


caa 


atg 


eta 


get 


gac 


tac 


ctt 


432 


/ u oiy 


XicU. 


Thr 

X IIX 




Glu 


± X p 


His^ 


Pro 


Thr 


Gin 


Met 


Leu 


Ala 


Asp 


Tyr 


Leu 




7 1 


130 










135 










140 












7 3 act 


ate 


aaa 


gaa 


aac 


ttc 


CfQt 


aaa 


ctt 


gaa 


ggt 


att 


act 


ctt 


gtt 


tac 


480 


7 A Th-r 


X xc 


Lys 


Glu 


Asn 


Phe 


Glv 


Lys 


Leu 


Glu 


Gly 


He 


Thr 


Leu 


Val 


Tyr 




7 S IAS 










150 










155 










160 




77 -hal- 
/ / i_y L- 


y y 


gac 


y y a 


cgt 


aac 


aat 


gtt 


acc 


aac 


tea 


ctt 


tta 


gtg 


get 




528 


/ o L,yo 


Gly 


Ago 


VJ X 


Arg 


Asn 


Ash 


Val 


Ala 


Asn 


Ser 


Leu 


Leu 


Val 


Ala 


Gly 




7 Q 








165 










170 










175 




576 


O 1 3 p-t- 

ox dL. U 


1- trr 


3 t" (T 


y y y 


gtc 


aat 


gta 


cac 


ate 


ttt 


tet 


cea 


aaa 


gaa 


ctt 


tty 


X liX 


Leu 


Met 


vjxy 


Val 


Asn 


Val 


His 


He 


Phe 


Ser 


Pro 


Lvs 


Glu 


Leu 


Phe 




Q "5 






X o u 










185 










190 






624 


85 ccw 


get 


gaa 


gag 


att 


gtt 


aaa 


ttg 


get 


gaa 


gga 


tat 


gee 


aaa 


gaa 


tet 


86 Pro 


Ala 


Glu 


Glu 


He 


Val 


Lys 


Leu 


Ala 


Glu 


Gly 


Tyr 


Ala 


Lys 


Glu 


Ser 




87 




195 










200 










205 








672 


89 ggg 


get 


cac 


gtt 


etc 


gtt 


act 


gat 


aat 


gta 


gac 


gaa 


get 


gta 


aag 


gga 


90 Gly 


Ala 


His 


Val 


Leu 


Val 


Thr 


Asp 


Asn 


Val 


Asp 


Glu 


Ala 


Val 


Lys 


Gly 




91 


210 










215 










220 










720 


93 gca 


gac 


gtc 


ttt 


tac 


act 


gat 


gtc 


tgg 


gta 


teg 


atg 


gga 


gaa 


gaa 


gat 


94 Ala 


Asp 


Val 


Phe 


Tyr 


Thr 


Asp 


Val 


Trp 


Val 


Ser 


Met 


Gly 


Glu 


Glu 


Asp 




95 225 








230 










235 










240 




97 aag 


ttc 


aaa 


gaa 


cgc 


gtt 


gaa 


ctt 


ctt 


caa 


cea 


tat 


caa 


gta 


aac 


atg 


768 


98 Lys 


Phe 


Lys 


Glu 


Arg 


Val 


Glu 


Leu 


Leu 


'Gin 


Pro 


Tyr 


Gin 


Val 


Asn 


Met 




99 






245 










250 










255 






101 gaa ctg att aaa aaa get aat aat gat aat ctt ate ttc tta cac tgc 


816 


102 Glu Leu lie Lys Lys Ala Asn Asn Asp Asn- Leu lie Phe Leu His Cys 




103 






260 








265 








270 




864 


105 tta cct gca ttc cat gat aca aat acc gtt tat ggc aaa gac gtc get 


106 Leu Pro Ala Phe His Asp Thr Asn Thr Val Tyr Gly Lys Asp Val Ala 




107 




275 








280 








285 








109 gaa aaa ttt ggg gtc aag gaa atg ga^ gtt act gat gaa gtc ttc cgt 


912 


110 Glu Lys Phe Gly Val Lys Glu Met Glu Val Thr Asp Glu Val Phe Arg 




111 


290 








295 








300 








960 


113 age aaa tat get cgt cat ttc gac caa get gaa aat cgt atg cac act 


114 Ser Lys Tyr Ala Arg His Phe Asp Gin Ala Glu Asn Arg Met His Thr 




115 305 








310 








315 








320 




117 att aaa get gta atg get gca acc ctt gga aat ctt ttc att cea aaa 


1008 


118 lie Lys Ala Val Met Ala Ala Thr Leu Gly Asn Leu Phe lie Pro Lys 




119 








325 








330 








335 




121 gtt taa 




























1014 


122 Val 
































126 <210> 


SEQ 


ID NO: 2 


























127 <211> 


LENGTH: 


337 


























128 <212> 


TYPE 


: PRT 


























129 <213> 


ORGANISM: Streptococcus 


agalactiae 














132 <400> 


SEQUENCE 


: 2 


























134 Met Thr Gin Val Phe Gin Gly Arg Ser Phe Leu Ala Glu Lys Asp Phe 
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TIME: 12:56:33 
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135 1 



5 



10 



15 



138 Ser Arg Glu Glu Phe Glu Tyr Leu He Asp Phe Ser Ala His Leu Lys 

139 20 . 25 30 

142 Asp Leu Lys Lys Arg Gly Val Pro His His Tyr Leu Glu Gly Lys Asn 

143 35 40 45 

146 He Ala Leu Leu Phe Glu Lys Thr Ser Thr Arg Thr Arg Ala Ala Phe 

147 ,50 55 60 

150 Thr Thr Ala Ala He Asp Leu Gly Ala His Pro Glu Tyr Leu Gly Ala 

151 65 70 75 80 

154 Asn Asp He Gin Leu Gly Lys Lys Glu Ser Thr Glu Asp Thr Ala Lys 

155 85 90 95 

158 Val Leu Gly Arg Met Phe Asp Gly He Glu Phe Arg Gly Phe Ser Gin 

159 100 105 110 

162 Arg Met Val Glu Glu Leu Ala Glu Phe Ser Gly Val Pro Val Trp Asn 

163 115 . 120 125 

166 Gly Leu Thr Asp Glu Trp His Pro Thr Gin Met Leu Ala Asp Tyr Leu 

167 130 135 140 

170 Thr He Lys Glu. Asn Phe Gly Lys Leu Glu Gly He Thr Leu Val Tyr 
.171 145 150 155 160 

174 Cys Gly Asp Gly Arg Asn Asn Val Ala Asn Ser Leu Leu Val Ala Gly 

175 165 170 175 

178 Thr Leu Met Gly Val Asn Val His He Phe Ser Pro Lys Glu Leu Phe 

179 180 185 190 

182 Pro Ala Glu Glu He Val Lys Leu Ala Glu Gly Tyr Ala Lys Glu Ser 

183 195 200 205 

186 Gly Ala His Val Leu Val Thr Asp Asn Val Asp Glu Ala Val Lys Gly 

187 210 215 220 

190 Ala Asp Val Phe Tyr Thr Asp Val Trp Val Ser Met Gly Glu Glu Asp 

191 225 230 235 240 

194 Lys Phe Lys Glu Arg Val Glu Leu Leu Gin Pro Tyr Gin Val Asn Met 

195 245 n 250 255 

198 Glu Leu He Lys Lys Ala Asn Asn Asp Asn Leu He Phe Leu His Cys 

199 260 265 270 

202 Leu Pro Ala Phe His Asp Thr Asn Thr Val Tyr Gly Lys Asp Val Ala 

203 275 280 285 

206 Glu Lys Phe Gly Val Lys Glu Met Glu Val Thr Asp Glu Val Phe Arg 

207 290 295 300 

210 Ser Lys Tyr Ala Arg His Phe Asp Gin Ala Glu Asn Arg Met His Thr 

211 305 310 315 320 

214 He Lys Ala Val Met Ala Ala Thr Leu Gly Asn Leu Phe He Pro Lys 

215 325 330 335 
218 Val 

222 <210> SEQ ID NO: 3 

223 <211> LENGTH: 1197 

224 <212> TYPE: DNA 

225 <213> ORGANISM: Streptococcus agalactiae 

227 <220> FEATURE: 

228 <221> NAME/KEY: CDS 

229 <222> LOCATION: (1)..(1197) 
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RAW SEQUENCE LISTING DATE: 01/15/2002 

PATENT APPLICATION: US/09/868,195 TIME: 12:56:33 

Input Set : A:\GJE-71.ST25.txt 

Output Set: N:\CRF3\01I52002\I868195.raw 

230 <223> OTHER INFORMATION: 

233 <400> SEQUENCE: 3 

234 atg get aaa ttg act gtt aaa gac gtt gat ttg aag gta aaa aaa gtc 48 

235 Met Ala Lys Leu Thr Val Lys Asp Val Asp Leu Lys Val Lys Lys Val 

236 1 5 10 15 

238 etc gtt cgt gtt gac ttt aat gtg cct ttg aaa gac ggc gtt ate act 96 

239 Leu Val Arg Val Asp Phe Asn Val Pro Leu Lys Asp Gly Val lie Thr 

240 20 25 30 

242 aac gac aac cgt ate act gcg get ctt cca aca ate aag tat ate ate 144 

243 Asn Asp Asn Arg lie Thr Ala Ala Leu Pro Thr lie Lys Tyr lie lie 

244 35 40 45 

246 gaa eaa ggt ggt cgt get ate etc ttc tct eac ctt gga cgt gtt aaa 192 

247 Glu Gin Gly Gly Arg Ala lie Leu Phe Ser His Leu Gly Arg Val Lys 

248 50 55 60 

250 gaa gaa get gac aaa gaa gga aaa tea ctt gea ceg gta get get gat 240 

251 Glu Glu Ala Asp Lys Glu Gly Lys Ser Leu Ala Pro Val Ala Ala Asp 

252 65 "70 - 75 80 

254 tta get get aaa ctt ggt eaa gat gtt gta ttc cca ggt gtt act cgt 288 

255 Leu Ala Ala Lys Leu Gly Gin Asp Val Val Phe Pro Gly Val Thr Arg 

256 85 90 95 

258 ggt gea aaa tta gaa gaa gea ate aat get ttg gaa gat gga eaa gtt 336 

259 Gly Ala Lys Leu Glu Glu Ala He Asn Ala Leu Glu Asp Gly Gin Val 

260 100 105 110 

262 ctt ttg gtt gaa aac act cgt ttt gaa gat gtt gac ggt aag aaa gaa 3 84 

263 Leu Leu Val Glu Asn Thr Arg Phe Glu Asp Val Asp Gly Lys Lys Glu 

264 115 120 125 

266 tct aag aat gac gaa gaa ctt ggt aaa tac tgg get tea ctt gga gat 432 

267 Ser Lys Asn Asp Glu Glu Leu Gly Lys Tyr Trp Ala Ser Leu Gly Asp 

268 130 135 140 

270 gga ate ttc gtt aac gat gea ttt ggt aca gea cac cgt get eat gea 480 

271 Gly He Phe Val Asn Asp Ala Phe Gly Thr Ala His Arg Ala His Ala 

272 145 150 155 160 

274 tea aac gta ggt att tea gea aac gtt gaa aaa get gta get ggt ttc 528 

275 Ser Asn Val Gly He Ser Ala Asn Val Glu Lys Ala Val Ala Gly Phe 

276 165 - 170 175 

278 ctt ctt gaa aac gaa att get tac ate eaa gaa gea gtt gaa act cca 576 

279 Leu Leu Glu Asn Glu He Ala Tyr He Gin Glu Ala Val Glu Thr Pro 

280 180 185 190 

282 gaa cgc cca ttc gta get att ctt ggt ggc tea aaa gtt tct gat aag 624 
2 83 Glu Arg Pro Phe Val Ala He Leu Gly Gly Ser Lys Val Ser Asp Lys 
284 195 200 205 

286 att ggt gtt ate gaa aac ctt ctt gaa aaa get gat aaa gtt ctt ate 672 

287 He Gly Val He Glu Asn Leu Leu Glu Lys Ala Asp Lys Val Leu He 

288 210 215 220 

290 ggt ggt ggt atg act tac aca ttc tac aaa get eaa ggt ate gaa ate 720 

291 Gly Gly Gly Met Thr Tyr Thr Phe Tyr Lys Ala Gin Gly He Glu He 

292 225 230 235 240 

294 ggt aac tea ctt gta gaa gaa gac aaa ttg gat gtt get aaa gac etc 768 

295 Gly Asn Ser Leu Val Glu Glu Asp Lys Leu Asp Val Ala Lys Asp Leu 
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RAW SEQUENCE LISTING DATE: 01/15/2002 
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296 
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ggt 
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aaa 
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816 


299 


Leu 


Glu 
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Ser 


Asn 


Gly 
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Leu 


He 
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Val 


Asp 


Ser 


Lys 


Glu 




300 








260 










265 










270 








302 


gca 


aac 


gca 


ttt 


get 


ggt 


tat 


act 


gaa 


gtt 


cgc 


gac 


act 


gaa 


ggt 


gaa 


864 


303 


Ala 


Asn 


Ala 


Phe 


Ala 


Gly 


Tyr 


Thr 


Glu 


Val 


Arg 


Asp 


Thr 


Glu 


Gly 


Glu 




304 






275 










280 










285 










306 


gca 


gtt 


tea 


gaa 


ggg 


ttc 


ett 


ggt 


ett 


gac 


ate 


ggt 


cct 


aaa 


tea 


ate 


912 


307 


Ala 


Val 


Ser 


Glu 


Gly 


Phe 


Leu 


Gly 


Leu 


Asp 


He 


Gly 


Pro 


Lys 


Ser 


He 




308 




290 










295 










300 












310 


get 


aaa 


ttt 


gat 


gaa 


gca 


ctt 


act 


ggt 


get 


aaa 


aca 


gtt 


gta 


tgg 


aac 


960 


311 


Ala 


Lys 


Phe 


Asp 


Glu 


Ala 


Leu 


Thr 


Gly 


Ala 


Lys 


Thr 


Val 


Val 


Trp 


Asn 




312 


305 










310 










315 










320 




314 


gga 


cct 


atg 


ggt 


gtc 


ttt 


gaa 


aac 


cct 


gac 


ttc 


caa 


get 


ggt 


aca 


ate 


1008 


315 


Gly 


Pro 


Met 


Gly 


Val 


Phe 


Glu 


Asn 


Pro 


Asp 


Phe 


Gin 


Ala 


Gly 


Thr 


He 




316 










325 










330 










335 






318 


ggt 


gta 


atg 


gac 


get 


ate 


gtt 


aaa 


caa 


eca 


ggc 


gtt 


aaa 


tea 


ate 


ate 


1056 


319 


Gly Val 


Met 


Asp 


Ala 


He 


Val 


Lys 


Gin 


Pro 


Gly 


Val 


Lys 


Ser 


He 


He 




320 








340 










345 










350 








322 


ggt 


ggt 


ggt 


gat 


tea 


gca 


gca 


get 


get 


ate 


aac 


ctt 


ggt 


egt 


get 


gac 


1104 


323 


Gly Gly Gly Asp 


Ser 


Ala 


Ala 


Ala 


Ala 


He 


Asn 


Leu 


Gly 


Arg 


Ala 


Asp 




324 






355 










360 










365 










326 


aaa 


ttc 


tea 


tgg 


ate 


tct 


act 


ggt 


ggt 


gga 


gca 


age 


atg 


gaa 


ttg 


, etc 


1152 


327 


Lys 


Phe 


Ser 


Trp 


He 


Ser 


Thr 


Gly 


Gly 


Gly 


Ala 


Ser 


Met 


Glu 


Leu 


Leu 




328 




370 










375 










380 
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gaa 


ggt 


aaa 


gta 


tta 


eca 


ggt 


ttg 


gca 


gca 


ttg 


act 


gaa 


aaa 


taa 




1197 


331 


Glu 


Gly 


Lys 


Val 


Leu 


Pro 


Gly 


Leu 


Ala 


Ala 


Leu 


Thr 


Glu 


Lys 








332 


385 










390 










395 















335 <210> SEQ ID NO: 4 

336 <211> LENGTH: 398 

337 <212> TYPE: PRT 

338 <213> ORGANISM: Streptococcus agalactiae 
340 <400> SEQUENCE: 4 

342 Met Ala Lys Leu Thr Val Lys Asp Val Asp Leu Lys Val Lys Lys Val 

343 15 10 15 

34 6 Leu Val Arg Val Asp Phe Asn Val Pro Leu Lys Asp Gly Val He Thr 
347 20 25 30 

350 Asn Asp Asn Arg He Thr Ala Ala Leu Pro Thr He Lys Tyr He He 

351 35 40 45 

354 Glu Gin Gly Gly Arg Ala He Leu Phe Ser His Leu Gly Arg Val Lys 

355 50 55 60 

358 Glu Glu Ala Asp Lys Glu Gly Lys Ser Leu Ala Pro Val Ala Ala Asp 

359 65 70 75 80 

362 Leu Ala Ala Lys Leu Gly Gin Asp Val Val Phe Pro Gly Val Thr Arg 

363 85 90 95 

366 Gly Ala Lys Leu Glu Glu Ala He Asn Ala Leu Glu Asp Gly Gin Val 

367 100 105 110 

370 Leu Leu Val Glu Asn Thr Arg Phe Glu Asp Val Asp Gly Lys Lys Glu 

371 115 120 125 
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